Blind Men and the Elephant: Piecing Together Hadoop for Diagnosis

نویسندگان

  • Xinghao Pan
  • Jiaqi Tan
  • Soila Pertet
  • Rajeev Gandhi
  • Priya Narasimhan
چکیده

Google’s MapReduce framework enables distributed, data-intensive, parallel applications by decomposing a massive job into smaller (Map and Reduce) tasks and a massive data-set into smaller partitions, such that each task processes a different partition in parallel. However, performance problems in a distributed MapReduce system can be hard to diagnose and to localize to a specific node or a set of nodes. On the other hand, the structure of large number of nodes performing similar tasks naturally affords us opportunities for observing the system from multiple viewpoints. We present a “Blind Men and the Elephant” (Blimey) framework in which we exploit this structure, and demonstrate how problems in a MapReduce system can be diagnosedůů by corroborating the multiple viewpoints. More specifically, we present algorithms within the Blimey framework based on OS-level performance counters, on white-box metrics extracted from logs, and on application-level heartbeats. We show that our Blimey algorithms are able to capture a variety of faults including resource hogs and application hangs, and to localize the fault to subsets of slave nodes in the MapReduce system. In addition, we discuss how the diagnostic algorithms’ outcomes can be further synthesized in a repeated application of the Blimey approach. We present a simple supervised learning technique which allows us to identify a fault if it has been previously observed. Keywords-MapReduce, Hadoop, Failure Diagnosis

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The blind men and the elephant.

It was six men of Indostan To learning much inclined, Who went to see the Elephant (Though all of them were blind), That each by observation Might satisfy his mind. The First approached the Elephant, And happening to fall Against his broad and sturdy side, At once began to bawl: "God bless me! but the Elephant Is very like a wall!" The Second, feeling of the tusk, Cried, "Ho! What have we here?...

متن کامل

The Genome: An Outsider's View

The Buddha once told a story about a king who ordered a group of blind men to be presented with an elephant. Each man touched a different part of the animal. The king then asked them what an elephant is like. The blind men who touched the elephant’s head replied, ‘‘An elephant, your majesty, is just like a water jar.’’ The blind men who touched its ear said, ‘‘An elephant, your majesty, is just...

متن کامل

Video in science

The Chinese folktale, Three Blind Men and an Elephant, is more than 2,000 years old. Three blind men approached an elephant and tried to describe it. The first man felt the elephant’s ear and decided that it was like a fan. The second man touched the elephant’s knee and decided that it was like a tree. The third man held the elephant’s trunk and concluded that it was like a snake. They could no...

متن کامل

Local Information, Observable Parameters, and Global View

SUMMARY The " Blind Men and an Elephant " is an old Indian story about a group of blind men who encounter an elephant and do not know what it is. This story describes the difficulties of understanding a large concept or global view based on only local information. Modern technologies enable us to easily obtain and retain local information. However, simply collecting local information does not g...

متن کامل

Adaptive Dynamic Data Placement Algorithm for Hadoop in Heterogeneous Environments

Hadoop MapReduce framework is an important distributed processing model for large-scale data intensive applications. The current Hadoop and the existing Hadoop distributed file system’s rack-aware data placement strategy in MapReduce in the homogeneous Hadoop cluster assume that each node in a cluster has the same computing capacity and a same workload is assigned to each node. Default Hadoop d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009